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GENERAL INFORMATION: 

(i) APPLICANT: von Schaewen, Antje Dr. rer. nat . 

(ii) TITLE OF INVENTION: Plant gntl sequences and the use thereof fo * . • 
the production of plants having reduced or lacking N- a cet^ CENTER 160WZW 

glucosaminyl transferase I {GnTI ) activity 

(iii) NUMBER OF SEQUENCES: 6 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: von Schaewen, Antje Dr. rer. nat. 

(B) STREET: Natruper strasse 169a 

(C) CITY: Osnabrueck 

(D) COUNTRY: Germany 

(E) ZIP: D-49076 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA 

(A) APPLICATION NUMBER: 09/591,466 

(B) FILING DATE: 06/09/2000 

(C) CLASSIFICATION: 536 



INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1669 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Solanum tuberosum 

(B) STRAIN: Desiree 

(D) DEVELOPMENTAL STAGE: Sink organ 

(F) TISSUE TYPE: Mesophyll 

(G) CELL TYPE: Leaf cells 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: Lambda ZAP II (Eco RI ) 

(B) CLONE: gntl-Al (K) 

(ix) FEATURE: 

(A) NAME /KEY : misc__f eature 

(B) LOCATION: 659. . 667 

(D) OTHER INFORMATION: /function- "Asn codon in this 
context is a potential glycosylat ion site" 
/product^ "N-glycosylation consensus sequence" 



1 /phenotype= "N-glycans modulate protein 

2 properties" 

3 /standard_name= "N-glycosylat ion site" 

4 /label- pot-CHO 

5 /note= "GnTI-coding sequences from animals do not 

6 contain this feature" 
7 

8 (ix) FEATURE: 

9 (A) NAME /KEY : CDS 

10 (B) LOCATION: 53. .1393 

2_i (C) IDENTIFICATION METHOD: experimental 

12 (D) OTHER INFORMATION: /codon_start= 53 

13 /function- "initiates complex N-glycans on 

14 secretory glycoproteins" 

15 /EC__number= 2.4.1.101 

16 /product- 

17 "beta-1, 2-N-acetylglucosaminyltransf erase I" 

18 /evidence- EXPERIMENTAL 

19 /gene- "cgl" 

2 0 /standard_name- "gntl" 

21 /label= ORF 

22 /note- "first gntl sequence from potato (unpublished)" 
23 

24 (ix) FEATURE: 

2 5 (A) NAME /KEY : 5 ' UTR 

26 (B) LOCATION: 15 .. 52 

27 

2 8 (ix) FEATURE: 

2 9 (A) NAME /KEY : 3 1 UTR 

30 (B) LOCATION: 1394 .. 1655 

31 

3 2 (ix) FEATURE: 

3 3 (A) NAME /KEY : CDS 

34 (B) LOCATION: 80. .139 

35 (d) OTHER INFORMATION: /function^ "membrane anchor (amino 

36 acids 10-29) " 

3 7 /product= "hydrophobic amino acid stretch in GnTI" 
33 /standard_name= "membrane anchor of a type II 

39 Golgi protein" 

4 0 /note= "identified by comparison with GnTI sequences 
41 from animals" 

42 

4 3 (ix) FEATURE: 

44 (A) NAME/KEY: misc_feature 

45 (B) LOCATION: 1 14 

46 (D) OTHER INFORMATION: /function- "used for cloning the 

47 cDNA library in Lambda ZAPII" 

4 8 /product= "EcoRI /Notl-cDNA adapter" 

49 /number- 1 

50 

51 (ix) FEATURE: 

52 (A) NAME /KEY : misc_feature 

53 (B) LOCATION: 1656. . 1669 

54 (D) OTHER INFORMATION: /product- "EcoRI /Notl-cDNA adapter" 

55 /number= 2 
56 

57 

58 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



59 
60 
61 
62 
63 



GAATTCGCGG CCGCCTGAGA AACCCTCGAA TTCAATTTCG CATTTGGCAG AG ATG 55 

Met 



1 



2 CTT TAC CGC TCA GAC TTT TTT CCT GGT CTT GGA TGG ATG CTT TCA AAA 871 

3 Leu Tvr Arq Ser Asp Phe Phe Pro Gly Leu Gly Trp Met Leu Ser Lys 

4 260 265 270 

6 TCA ACT TGG TCC GAA CTA TCT CCA AAG TGG CCA AAG GCT TAC TGG GAT 919 

7 Ser Thr Trp Ser Glu Leu Ser Pro Lys Trp Pro Lys Ala Tyr Trp Asp 

8 275 280 285 
9 

10 GAC TGG CTA AGG CTG AAA GAA AAT CAC AGA GGT CGA CAA TTT ATT CGC 967 

11 Asp Trp Leu Arg Leu Lys Glu Asn His Arg Gly Arg Gin Phe lie Arg 

12 290 295 300 305 

14 CCA GAA GTT TGC AGA ACG TAC AAT TTT GGT GAG CAT GGT TCT AGT TTG 1015 

15 Pro Glu Val Cys Arg Thr Tyr Asn Phe Gly Glu His Gly Ser Ser Leu 

16 310 315 320 
17 

18 GGG CAG TTT TTT AAG CAG TAT CTT GAG CCA ATT AAG CTA AAT GAT GTC 1063 

19 Gly Gin Phe Phe Lys Gin Tyr Leu Glu Pro lie Lys Leu Asn Asp Val 

20 325 330 335 
2 1 

22 CAG GTT GAT TGG AAG TCA ATG GAC CTA AGT TAC CTT TTG GAG GAC AAC 1111 

23 Gin Val Asp Trp Lys Ser Met Asp Leu Ser Tyr Leu Leu Glu Asp Asn 

24 340 345 350 
25 

2 6 TAT GTG AAA CAC TTT GGC GAC TTG GTT AAA AAG GCT AAG CCC ATC CAC 115 9 

27 Tyr Val Lys His Phe Gly Asp Leu Val Lys Lys Ala Lys Pro lie His 

28 355 360 365 
29 

3 0 GGA GCT GAT GCT GTT TTG AAA GCA TTT AAC ATA GAT GGT GAT GTG CGT 1207 

31 Gly Ala Asp Ala Val Leu Lys Ala Phe Asn He Asp Gly Asp Val Arg 

32 370 375 380 385 
33 

3 4 ATT CAG TAC AGA GAC CAA CTA GAC TTT GAA GAT ATC GCT CGA CAG TTT 125 5 

3 5 He Gin Tyr Arg Asp Gin Leu Asp Phe Glu Asp He Ala Arg Gin Phe 
36 390 395 400 

37 

3 8 GGC ATT TTT GAA GAA TGG AAG GAT GGT GTA CCA CGG GCA GCA TAT AAA 1303 

3 9 Gly He Phe Glu Glu Trp Lys Asp Gly Val Pro Arg Ala Ala Tyr Lys 
40 405 410 415 

41 

42 GGG ATA GTA GTT TTC CGG TTT CAA ACA TCT AGA CGT GTG TTC CTT GTT 1351 

43 Gly He Val Val Phe Arg Phe Gin Thr Ser Arg Arg Val Phe Leu Val 

44 420 425 430 
45 

4 6 TCC CCT GAT TCT CTT CGA CAA CTT GGA GTT GAA GAT ACT TAG 1393 

47 Ser Pro Asp Ser Leu Arg Gin Leu Gly Val Glu Asp Thr * 

48 435 440 445 

4 9 

50 CG AAG AT ATG ATTGGAGCCT GAG C AAC AAT TTAGACTTAT TTGGTAGGAT ACATTTGAAA 14 53 

5 1 

52 GAGCTGACAC GAAAAGTATG ACTACCAGTA GCT AC AT GCA AC AT TTT AAT GTTAATGGAA 1513 
53 

54 G G AAC C CAC T GCTTATTGTT GGAATGGATG AAT CAT CAC C ACATCCTATT ATTCAAGTTT 157 3 

55 

5 6 ACAAACATAA AG AG G AAAT G TTGCCCTATA AAAACAAATT TTTTGTTTCT AAG AAG G AAC 1633 
57 

58 GTTACGATTA TGAGCAACTT TGGCGGCCGC GAATTC 1669 
59 

6 0 (2) INFORMATION FOR SEQ ID NO: 2: 
61 

6 2 (i) SEQUENCE CHARACTERISTICS: 

6 3 (A) LENGTH: 44 6 amino acids 



5 



1 (B) TYPE: amino acid 

2 (D) TOPOLOGY: linear 
3 

4 (ii) MOLECULE TYPE: protein 

5 <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

7 Met Arq Gly Asn Lys Phe Cys Phe Asp Leu Arg Tyr Leu Leu Val Val 
15 10 15 



Ala Ala Leu Ala Phe He Tyr He Gin Met Arg Leu Phe Ala Thr Gin 
20 25 30 



8 
9 
10 
11 
12 

13 Ser Glu Tyr Val Asp Arg Leu Ala Ala Ala He Glu Ala Glu Asn His 

14 35 40 45 
15 

16 Cys Thr Ser Gin Thr Arg Leu Leu He Asp Lys He Ser Gin Gin Gin 

17 50 55 60 
18 

19 Gly Arg Val Val Ala Leu Glu Glu Gin Met Lys His Gin Asp Gin Glu 

20 65 70 75 80 
21 

22 Cys Arg Gin Leu Arg Ala Leu Val Gin Asp Leu Glu Ser Lys Gly He 

23 85 90 95 
24 

25 Lys Lys Leu He Gly Asp Val Gin Met Pro Val Ala Ala Val Val Val 

26 100 105 HO 
27 

2 8 Met Ala Cys Ser Arg Thr Asp Tyr Leu Glu Arg Thr He Lys Ser He 
29 H5 120 125 

30 

31 Leu Lys Tyr Gin Thr Ser Val Ala Ser Lys Tyr Pro Leu Phe He Ser 

32 130 135 140 
33 

34 Gin Asp Gly Ser Asn Pro Asp Val Arg Lys Leu Ala Leu Ser Tyr Gly 

35 145 150 155 160 
36 

3 7 Gin Leu Thr Tyr Met Gin His Leu Asp Tyr Glu Pro Val His Thr Glu 
38 165 170 175 

39 

4 0 Arg Pro Gly Glu Leu Val Ala Tyr Tyr Lys He Ala Arg His Tyr Lys 
41 180 185 190 

42 

43 Trp Ala Leu Asp Gin Leu Phe His Lys His Asn Phe Ser Arg Val He 

44 195 200 205 
45 

4 6 He Leu Glu Asp Asp Met Glu He Ala Ala Asp Phe Phe Asp Tyr Phe 

47 210 215 220 

48 

4 9 Glu Ala Gly Ala Thr Leu Leu Asp Arg Asp Lys Ser He Met Ala He 
50 225 230 235 240 

51 

52 Ser Ser Trp Asn Asp Asn Gly Gin Arg Gin Phe Val Gin Asp Pro Asp 

53 245 250 255 
54 

55 Ala Leu Tyr Arg Ser Asp Phe Phe Pro Gly Leu Gly Trp Met Leu Ser 

56 260 265 270 
57 

58 Lys Ser Thr Trp Ser Glu Leu Ser Pro Lys Trp Pro Lys Ala Tyr Trp 

59 275 280 285 
60 

61 Asp Asp Trp Leu Arg Leu Lys Glu Asn His Arg Gly Arg Gin Phe He 

62 290 295 300 
63 



6 



1 Arg Pro Glu Val Cys Arg Thr Tyr Asn Phe Gly Glu His Gly Ser Ser 

2 305 310 315 320 
3 

4 Leu Gly Gin Phe Phe Lys Gin Tyr Leu Glu Pro He Lys Leu Asn Asp 

5 325 330 335 
6 

7 Val Gin Val Asp Trp Lys Ser Met Asp Leu Ser Tyr Leu Leu Glu Asp 

8 340 345 350 
9 

10 Asn Tyr Val Lys His Phe Gly Asp Leu Val Lys Lys Ala Lys Pro He 

11 355 360 365 
12 

13 His Gly Ala Asp Ala Val Leu Lys Ala Phe Asn He Asp Gly Asp Val 

14 370 375 380 
15 

16 Arg lie Gin Tyr Arg Asp Gin Leu Asp Phe Glu Asp He Ala Arg Gin 

17 385 390 395 400 
18 

19 Phe Gly He Phe Glu Glu Trp Lys Asp Gly Val Pro Arg Ala Ala Tyr 

20 405 410 415 
21 

22 Lys Gly He Val Val Phe Arg Phe Gin Thr Ser Arg Arg Val Phe Leu 

23 420 425 430 
24 

2 5 Val Ser Pro Asp Ser Leu Arg Gin Leu Gly Val Glu Asp Thr 
26 435 440 445 

27 

2 8 (2) INFORMATION FOR SEQ ID NO: 3: 
29 

3 0 (i) SEQUENCE CHARACTERISTICS: 

31 (A) LENGTH: 1737 base pairs 

32 (B) TYPE: nucleic acid 

3 3 (C) STRANDEDNESS : double 

34 ( D) TOPOLOGY: linear 

35 

3 6 (ii) MOLECULE TYPE: cDNA to mRNA 

37 

3 8 (iii) HYPOTHETICAL: NO 
39 

40 (iv) ANTI-SENSE: NO 

41 

42 (vi) ORIGINAL SOURCE: 

43 (A) ORGANISM: Nicotiana tabacum 

44 (B) STRAIN: Samsun NN 

4 5 (D)' DEVELOPMENTAL STAGE: Sink organ 

46 ( F) TISSUE TYPE: Mesophyll 

47 (G) CELL TYPE: Leaf cells 
48 

4 9 (vii) IMMEDIATE SOURCE: 

5 0 (A) LIBRARY: Lambda ZAP II (Eco RI ) 
51 (B) CLONE: gntI-A9{T) 

52 

53 (ix) FEATURE: 

54 (A) NAME /KEY : misc_feature 

55 (B) LOCATION: 733. .741 

56 (D) OTHER INFORMATION: /function= "Asn codon in this 

57 context is a potential glycosylation site" 

58 /product= "N-glycosylation consensus sequence" 

59 /phenotype- "N-glycans modulate protein 

60 properties" 

61 /standard_name= "N-glycosylation site" 

62 /label= pot-CHO 

6 3 /note= "GnTI sequences from animals do not contain this 
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1 feature" 
2 

3 (ix) FEATURE: 

4 (A) NAME /KEY : CDS 

5 (B) LOCATION: 127 14 67 

6 (C) IDENTIFICATION METHOD: experimental 

7 (D) OTHER INFORMATION: /codon_start= 127 

8 /function= "initiates complex N-glycans on 

9 secretory glycoproteins" 

10 /EC_number= 2.4.1.101 

11 /product^ 

12 "beta-1, 2-N-acetylglucosaminyltransf erase 1 

13 /evidence- EXPERIMENTAL 

14 /gene= "cgl" 

15 /standard_name= "gntl" 

16 /label= ORF „ 

17 / no te= "first gntl sequence from tobacco (unpublished) 

18 

19 (ix) FEATURE: 

2 0 (A) NAME/KEY: 5 ' UTR 

21 (B) LOCATION: 15 126 

2 2 

2 3 (ix) FEATURE: 

24 (A) NAME /KEY : 3 ' UTR 

25 (B) LOCATION: 1468. .1723 

26 

2 7 (ix) FEATURE: 

28 (A) NAME/KEY: CDS 

29 (B) LOCATION: 154 .. 213 

30 (D) OTHER INFORMATION: /function- "membrane anchor (ammo 

31 acids 10-29) " , 

32 /product^ "hydrophobic amino acid stretch xn GnTI 

33 /standard_name= "membrane anchor of a type II 

34 golgi protein" 
35 

3 6 (ix) FEATURE: 

37 (A) NAME /KEY : misc_feature 

3 8 (B) LOCATION: 1 .. 14 

39 (D) OTHER INFORMATION: /function= "use for cloning the 

4 0 cDNA library in Lambda ZAPII" 

41 /product= "EcoRI/Notl-cDNA adapter" 

4 2 /number- 1 

43 

44 (ix) FEATURE: 

45 (A) NAME /KEY : misc_feature 

46 ( B) LOCATION: 1724 .. 1737 

47 (D) OTHER INFORMATION: /product- "EcoRI/Notl-cDNA adapter 
43 /number= 2 

49 
50 

51 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

53 GAATTCGCGG CCGCCATTGA CTTGATCCTA ACTGAACAGG CAAAGTAAAT CCAGCGATGA 60 

55 AACACTCATA ACTGAACACT GAGAGACTAT TCGCTTTCTC CTAAAGCCTT CAATCGAATT 120 

57 CGCACG ATG AGA GGG AAC AAG TTT TGC TGT GAT TTC CGG TAG CTC CTC 168 

co M et Arg Gly Asn Lys Phe Cys Cys Asp Phe Arg Tyr Leu Leu 

59 450 455 460 

61 ATC TTG GCT GCT GTC GCC TTC ATC TAC ACA CAG ATG CGG CTT TTT GCG 216 

62 He Leu Ala Ala Val Ala Phe He Tyr Thr Gin Met Arg Leu Phe Ala 

63 465 470 475 



360 



408 



2 ACA CAG TCA GAA TAT GCA GAT CGC CTT GCT GCT GCA ATT GAA GCA GAA 2 64 

3 Thr Gin Ser Glu Tyr Ala Asp Arg Leu Ala Ala Ala He Glu Ala Glu 

4 480 485 490 

6 AAT CAT TGT ACA AGC CAG ACC AGA TTG CTT ATT GAC CAG ATT AGC CTG 312 

7 Asn His Cys Thr Ser Gin Thr Arg Leu Leu He Asp Gin He Ser Leu 

8 495 500 505 

10 CAG CAA GGA AGA ATA GTT GCT CTT GAA GAA CAA ATG AAG CGT CAG GAC 

11 Gin Gin Gly Arg He Val Ala Leu Glu Glu Gin Met Lys Arg Gin Asp 

12 510 515 520 525 

14 CAG GAG TGC CGA CAA TTA AGG GCT CTT GTT CAG GAT CTT GAA AGT AAG 

15 Gin Glu Cys Arg Gin Leu Arg Ala Leu Val Gin Asp Leu Glu Ser Lys 

16 530 535 540 

18 GGC ATA AAA AAG TTG ATC GGA AAT GTA CAG ATG CCA GTG GCT GCT GTA 4 56 

19 Glv He Lys Lys Leu He Gly Asn Val Gin Met Pro Val Ala Ala Val 

20 545 550 555 

22 GTT GTT ATG GCT TGC AAT CGG GCT GAT TAC CTG GAA AAG ACT ATT AAA 5 04 

2 3 Val Val Met Ala Cys Asn Arg Ala Asp Tyr Leu Glu Lys Thr He Lys 
24 560 565 570 

2 6 TCC ATC TTA AAA TAC CAA ATA TCT GTT GCG TCA AAA TAT CCT CTT TTC 552 

27 Ser He Leu Lys Tyr Gin He Ser Val Ala Ser Lys Tyr Pro Leu Phe 

28 575 580 585 



600 



648 



696 



3 0 ATA TCC CAG GAT GGA TCA CAT CCT GAT GTC AGG AAG CTT GCT TTG AGC 

31 He Ser Gin Asp Gly Ser His Pro Asp Val Arg Lys Leu Ala Leu Ser 

32 590 595 600 605 

34 TAT GAT CAG CTG ACG TAT ATG CAG CAC TTG GAT TTT GAA CCT GTG CAT 

3 5 Tyr Asp Gin Leu Thr Tyr Met Gin His Leu Asp Phe Glu Pro Val His 
36 610 615 620 

3 8 ACT GAA AGA CCA GGG GAG CTG ATT GCA TAC TAC AAA ATT GCA CGT CAT 

3 9 Thr Glu Arg Pro Gly Glu Leu He Ala Tyr Tyr Lys He Ala Arg His 
40 625 630 635 

4 2 TAC AAG TGG GCA TTG GAT CAG CTG TTT TAC AAG CAT AAT TTT AGC CGT 744 
4 3 Tyr Lys Trp Ala Leu Asp Gin Leu Phe Tyr Lys His Asn Phe Ser Arg 

44 640 645 650 

4 6 GTT ATC ATA CTA GAA GAT GAT ATG GAA ATT GCC CCT GAT TTT TTT GAC 

47 Val He He Leu Glu Asp Asp Met Glu He Ala Pro Asp Phe Phe Asp 

48 655 660 665 

5 0 TTT TTT GAG GCT GGA GCT ACT CTT CTT GAC AGA GAC AAG TCG ATT ATG 

51 Phe Phe Glu Ala Gly Ala Thr Leu Leu Asp Arg Asp Lys Ser He Met 

52 670 675 680 685 

54 GCT ATT TCT TCT TGG AAT GAC AAT GGA CAA ATG CAG TTT GTC CAA GAT 

5 5 Ala He Ser Ser Trp Asn Asp Asn Gly Gin Met Gin Phe Val Gin Asp 
56 690 695 700 

58 CCT TAT GCT CTT TAC CGC TCA GAT TTT TTT CCC GGT CTT GGA TGG ATG 

59 Pro Tyr Ala Leu Tyr Arg Ser Asp Phe Phe Pro Gly Leu Gly Trp Met 

60 705 710 715 

62 CTT TCA AAA TCT ACT TGG GAC GAA TTA TCT CCA AAG TGG CCA AAG GCT 

63 Leu Ser Lys Ser Thr Trp Asp Glu Leu Ser Pro Lys Trp Pro Lys Ala 



792 



840 



888 



936 



984 



1 720 725 730 

3 TAC TGG GAC GAC TGG CTA AGA CTC AAA GAG AAT CAC AGA GGT CGA CAA 10 32 

4 Tyr Trp Asp Asp Trp Leu Arg Leu Lys Glu Asn His Arg Gly Arg Gin 

5 735 740 745 

7 TTT ATT CGC CCA GAA GTT TGC AGA ACA TAT AAT TTT GGT GAG CAT GGT 1080 

8 Phe He Arg Pro Glu Val Cys Arg Thr Tyr Asn Phe Gly Glu His Gly 

9 750 755 760 /bb 

11 TCT AGT TTG GGG CAG TTT TTC AAG CAG TAT CTT GAG CCA ATT AAA CTA 1128 

12 Ser Ser Leu Gly Gin Phe Phe Lys Gin Tyr Leu Glu Pro He Lys Leu 

13 770 775 780 

15 AAT GAT GTC CAG GTT GAT TGG AAG TCA ATG GAC CTT AGT TAC CTT TTG 117 6 

16 Asn Asp Val Gin Val Asp Trp Lys Ser Met Asp Leu Ser Tyr Leu Leu 

17 785 790 795 

19 GAG GAC AAT TAC GTG AAA CAC TTT GGT GAC TTG GTT AAA AAG GCT AAG 1224 

2 0 Glu Asp Asn Tyr Val Lys His Phe Gly Asp Leu Val Lys Lys Ala Lys 
21 800 805 810 

23 CCC ATC CAT GGA GCT GAT GCT GTC TTG AAA GCA TTT AAC ATA GAT GGT 1272 

24 Pro He His Gly Ala Asp Ala Val Leu Lys Ala Phe Asn He Asp Gly 

25 815 820 825 

27 GAT GTG CGT ATT CAG TAC AGA GAT CAA CTA GAC TTT GAA AAT ATC GCA 1320 

2 8 Asp Val Arg lie Gin Tyr Arg Asp Gin Leu Asp Phe Glu Asn He Ala 
29 830 835 840 845 

31 CGG CAA TTT GGC ATT TTT GAA GAA TGG AAG GAT GGT GTA CCA CGT GCA 13 68 

32 Arg Gin Phe Gly He Phe Glu Glu Trp Lys Asp Gly Val Pro Arg Ala 

33 850 855 860 

3 5 GCA TAT AAA GGA ATA GTA GTT TTC CGG TAC CAA ACG TCC AGA CGT GTA 1416 

3 6 Ala Tyr Lys Gly He Val Val Phe Arg Tyr Gin Thr Ser Arg Arg Val 
37 865 870 875 

3 9 TTC CTT GTT GGC CAT GAT TCG CTT CAA CAA CTC GGA ATT GAA GAT ACT 14 64 

40 Phe Leu Val Gly His Asp Ser Leu Gin Gin Leu Gly He Glu Asp Thr 

41 880 885 890 

4 3 TAA C AAAGAT AT G ATTGCAGGAG CCCGGGCAAA ATTTTTGACT TATTGGGTAG 1517 
44 * 

45 

47 GATGCATCGA GCT GAC ACT A AACCATGATT TTACCAGTTA CATACAACGT TTTAATGTTA 1577 

4 9 TACGGAGGAG CTCACTGTTC TAGTGTTGAA GGGATATCGG CTTCTTAGTA TTG GAT G AAT 1637 

51 CATCAACACA ACCTATTATT TTAAGTGTTC AGAACATAAA GAGGAAATGT AGCCCTGTAA 1697 
52 

53 AGACTATACA TGGGACCATC ATAATCGCGG CCGCGAATTC 
54 
55 

56 (2) INFORMATION FOR SEQ ID NO: 4: 
57 

58 (i) SEQUENCE CHARACTERISTICS: 

59 (A) LENGTH: 44 6 amino acids 

60 (B) TYPE: amino acid 
51 (D) TOPOLOGY: linear 
62 

63 (ii) MOLECULE TYPE: protein 



1737 
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1 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

3 Met Arg Gly Asn Lys Phe Cys Cys Asp Phe Arg Tyr Leu Leu lie Leu 

4 1 5 10 1= 

6 Ala Ala Val Ala Phe lie Tyr Thr Gin Met Arg Leu Phe Ala Thr Gin 

7 20 25 30 
3 

9 Ser Glu Tyr Ala Asp Arg Leu Ala Ala Ala He Glu Ala Glu Asn His 
10 35 40 45 

12 Cys Thr Ser Gin Thr Arg Leu Leu He Asp Gin He Ser Leu Gin Gin 

13 50 55 60 
14 

15 Gly Arg He Val Ala Leu Glu Glu Gin Met Lys Arg Gin Asp Gin Glu 

16 65 70 75 80 
17 

18 Cvs Arq Gin Leu Arg Ala Leu Val Gin Asp Leu Glu Ser Lys Gly He 

19 85 90 95 
20 

21 Lvs Lys Leu He Gly Asn Val Gin Met Pro Val Ala Ala Val Val Val 

22 100 105 HO 
23 

24 Met Ala Cys Asn Arg Ala Asp Tyr Leu Glu Lys Thr He Lys Ser He 

25 115 120 125 

2 6 

27 Leu Lys Tyr Gin He Ser Val Ala Ser Lys Tyr Pro Leu Phe He Ser 

28 130 135 140 
29 

3 0 Gin Asp Gly Ser His Pro Asp Val Arg Lys Leu Ala Leu Ser Tyr Asp 
31 145 150 155 160 
32 

3 3 Gin Leu Thr Tyr Met Gin His Leu Asp Phe Glu Pro Val His Thr Glu 
34 165 170 175 

35 

3 6 Arq Pro Gly Glu Leu He Ala Tyr Tyr Lys He Ala Arg His Tyr Lys 
37 180 185 190 

3 8 

3 9 Trp Ala Leu Asp Gin Leu Phe Tyr Lys His Asn Phe Ser Arg Val He 
40 195 200 205 

41 
42 



He Leu Glu Asp Asp Met Glu He Ala Pro Asp Phe Phe Asp Phe Phe 
43 210 215 220 

44 

45 Glu Ala Gly Ala Thr Leu Leu Asp Arg Asp Lys Ser He Met Ala lie 

46 225 230 235 240 
47 

48 Ser Ser Trp Asn Asp Asn Gly Gin Met Gin Phe Val Gin Asp Pro Tyr 

49 245 250 255 
50 

51 Ala Leu Tyr Arg Ser Asp Phe Phe Pro Gly Leu Gly Trp Met Leu Ser 

52 260 265 270 
53 

54 Lvs Ser Thr Trp Asp Glu Leu Ser Pro Lys Trp Pro Lys Ala Tyr Trp 

55 275 280 285 

57 Asp Asp Trp Leu Arg Leu Lys Glu Asn His Arg Gly Arg Gin Phe He 

58 290 295 300 

5 9 

6 0 Arg Pro Glu Val Cys Arg Thr Tyr Asn Phe Gly Glu His Gly Ser Ser 
61 305 310 315 320 
6 2 

6 3 Leu Gly Gin Phe Phe Lys Gin Tyr Leu Glu Pro He Lys Leu Asn Asp 
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1 325 330 335 

2 

3 Val Gin Val Asp Trp Lys Ser Met Asp Leu Ser Tyr Leu Leu Glu Asp 

4 340 345 350 
5 

6 Asn Tyr Val Lys His Phe Gly Asp Leu Val Lys Lys Ala Lys Pro lie 

7 355 360 365 
8 

9 His Gly Ala Asp Ala Val Leu Lys Ala Phe Asn He Asp Gly Asp Val 
10 370 375 380 

11 

12 Arg He Gin Tyr Arg Asp Gin Leu Asp Phe Glu Asn lie Ala Arg Gin 

13 385 390 395 400 
14 

15 Phe Gly He Phe Glu Glu Trp Lys Asp Gly Val Pro Arg Ala Ala Tyr 

16 405 410 415 
17 

18 Lys Gly He Val Val Phe Arg Tyr Gin Thr Ser Arg Arg Val Phe Leu 

19 420 425 430 
20 

21 Val Gly His Asp Ser Leu Gin Gin Leu Gly He Glu Asp Thr 

22 435 440 445 
23 

24 

2 5 (2) INFORMATION FOR SEQ ID NO: 5: 
26 

2 7 (i) SEQUENCE CHARACTERISTICS: 

28 (A) LENGTH: 1854 base pairs 

29 (B) TYPE: Nucleotide 

3 0 (C) STRANDEDNESS : double 
31 (D) TOPOLOGY: linear 

32 

3 3 (ii) MOLECULE TYPE: cDNA to mRNA 

34 

35 (iii) HYPOTHETICAL: No 

36 

37 (iv) ANTI-SENSE: No 

38 

3 9 (vi) ORIGINAL SOURCE: 

4 0 (A) ORGANISM: Arabidopsis thaliana 
41 (B) STRAIN: Columbia 

4 2 ( D ) DEVELOPMENTAL STAGE: Mature plants 

43 (F) TISSUE TYPE: All tissues 

44 

4 5 (vii) IMMEDIATE SOURCE: 

46 (A) LIBRARY: Lambda Uni-ZAP (EcoRI/XhoI) and 

4 7 Lambda ACT (Xhol) 

48 (B) CLONE: pBSK-Ara-Gnt I-f ull #8 

49 

50 (ix) FEATURE: 

51 (A) NAME /KEY : misc_feature 

52 (B) LOCATION: 1185 .. 1193 

53 (D) OTHER INFORMATION: /function= "Asn Codon is a 

54 potential glycosylation site" 

55 /product^ "Consensus sequence for 

56 N- glycosylation" 

57 /phenotype= "N glycans modulate 

58 protein characteristics" 

59 /standard_name= "N glycosylation site" 

60 /label= pot-CHO 

61 /note= "absent in animal GnTI sequences" 
62 

63 (ix) FEATURE: 
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1 (A) NAME /KEY : CDS 

2 (B) LOCATION: 135 14 69 

3 (C) IDENTIFICATION METHOD: experimental 

4 (D) OTHER INFORMATION : /codon_st art- 135 

5 /function- "initiates complex N glycans on 

6 secretory glycoproteins" 

7 /EC_number= 2.4.1.101 

8 /product= 

9 "beta-1, 2-N-acetyl glucosaminyl transferase I" 

10 /evidence- EXPERIMENTAL 

11 /gene= "cgl" 

12 /standard_name- "gntl" 

13 /label= ORF 

14 /note- "first gntl sequence from Arabidopsis 

15 (unpublished) " 
16 

17 (ix) FEATURE: 

18 (A) NAME /KEY : 5 1 UTR 

19 (B) LOCATION: 19 134 
20 

21 (ix) FEATURE: 

2 2 (A) NAME/KEY : 3 ' UTR 

23 (B) LOCATION: 1470 .. 1848 

24 

2 5 (ix) FEATURE: 

2 6 (A) NAME /KEY : CDS 

27 (B) LOCATION: 157 .. 215 

2 8 (D) OTHER INFORMATION: /function= "membrane anchor 
29 (amino acids 8-27)" 

3 0 /product- "hydrophobic amino-acid region in 
31 GnTI" 

3 2 ' /standard_name= "membrane anchor of a Type II 

33 Golgi protein" 

34 /note- "identified by comparison with animal GnTI 

35 sequenzes " 
36 

3 7 (ix) FEATURE: 

3 8 (A) NAME /KEY : misc_feature 

39 (B) LOCATION: 1. . 18 

40 (D) OTHER INFORMATION: /function- "for preparation 

41 of a cDNA library in Lambda ACT" 

4 2 /product- "XhoI-cDNA-Adaptor " 
4 3 /number— 1 

44 

4 5 (ix) FEATURE: 

46 (A) NAME/KEY: misc_feature 

47 (B) LOCATION: 1849 .. 1854 

48 (D) OTHER INFORMATION: /product= "XhoI-cDNA- Adaptor " 

4 9 / number = 2 
50 

51 

52 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

54 CTCGAGGCCA CGAAGGCCAC CGTTTTTGTT ATAACGAACG ACACCGTTTC AAACAACTTC 60 
55 

5 6 CTTATTAGCT AGCTCCCTCC CGGCGGCAAA CACCAGAAGA TCCACCGCTT TTGATCTGGT 120 
57 

5 8 TGTTTGTCGT CGAT ATG GCG AGG ATC TCG TGT GAC TTG AGA TTT CTT CTC 17 0 

59 Met Ala Arg lie Ser Cys Asp Leu Arg Phe Leu Leu 

60 1 5 10 
61 

6 2 ATC CCG GCA GCT TTC ATG TTC ATC TAC ATC CAG ATG AGG CTT TTC CAG 218 
63 He Pro Ala Ala Phe Met Phe He Tyr He Gin Met Arg Leu Phe Gin 



15 20 25 

ACG CAA TCA CAG TAT GCA GAT CGC CTC AGT TCC GCT ATC GAA TCT GAG 
Thr Gin Ser Gin Tyr Ala Asp Arg Leu Ser Ser Ala lie Glu Ser Glu 
30 35 40 

AAC CAT TGC ACT AGT CAA ATG CGA GGC CTC ATA GAT GAA GTT AGC ATC 
Asn His Cys Thr Ser Gin Met Arg Gly Leu He Asp Glu Val Ser He 
45 50 55 60 

AAA CAG TCG CGG ATT GTT GCC CTC GAA GAT ATG AAG AAC CGC CAG GAC 
Lvs Gin Ser Arg He Val Ala Leu Glu Asp Met Lys Asn Arg Gin Asp 
^ 65 70 75 

GAA GAA CTT GTG CAG CTT AAG GAT CTA ATC CAG ACG TTT GAA AAA AAA 
Glu Glu Leu Val Gin Leu Lys Asp Leu He Gin Thr Phe Glu Lys Lys 
80 85 90 

GGA ATA GCA AAA CTC ACT CAA GGT GGA CAG ATG CCT GTG GCT GCT GTA 
Gly He Ala Lys Leu Thr Gin Gly Gly Gin Met Pro Val Ala Ala Val 
95 100 105 

GTG GTT ATG GCC TGC AGT CGT GCA GAC TAT CTT GAA AGG ACT GTT AAA 
Val Val Met Ala Cys Ser Arg Ala Asp Tyr Leu Glu Arg Thr Val Lys 
110 115 120 

TCA GTT TTA ACA TAT CAA ACT CCC GTT GCT TCA AAA TAT CCT CTA TTT 
Ser Val Leu Thr Tyr Gin Thr Pro Val Ala Ser Lys Tyr Pro Leu Phe 
125 130 135 140 

ATA TCT CAG GAT GGA TCT GAT CAA GCT GTC AAG AGC AAG TCA TTG AGC 
He Ser Gin Asp Gly Ser Asp Gin Ala Val Lys Ser Lys Ser Leu Ser 
145 150 155 

TAT AAT CAA TTA ACA TAT ATG CAG CAC TTG GAT TTT GAA CCA GTG GTC 
Tvr Asn Gin Leu Thr Tyr Met Gin His Leu Asp Phe Glu Pro Val Val 
Y 160 165 170 

ACT GAA AGG CCT GGT GAA CTG ACT GCG TAC TAC AAG ATT GCA CGT CAC 
Thr Glu Arg Pro Gly Glu Leu Thr Ala Tyr Tyr Lys He Ala Arg His 
175 180 185 

TAC AAG TGG GCA CTG GAC CAG TTG TTT TAC AAA CAC AAA TTT AGT CGA 
Tvr Lys Trp Ala Leu Asp Gin Leu Phe Tyr Lys His Lys Phe Ser Arg 
190 195 200 

GTG ATT ATA CTA GAA GAC GAT ATG GAA ATT GCT CCA GAC TTC TTT GAT 
Val He He Leu Glu Asp Asp Met Glu He Ala Pro Asp Phe Phe Asp 
205 210 215 220 

TAC TTT GAG GCT GCA GCT AGT CTC ATG GAT AGG GAT AAA ACC ATT ATG 
Tvr Phe Glu Ala Ala Ala Ser Leu Met Asp Arg Asp Lys Thr He Met 
225 230 235 

GCT GCT TCA TCA TGG AAT GAT AAT GGA CAG AAG CAG TTT GTG CAT GAT 
Ala Ala Ser Ser Trp Asn Asp Asn Gly Gin Lys Gin Phe Val His Asp 
240 245 250 

CCC TAT GCG CTA TAC CGA TCA GAT TTT TTT CCT GGC CTT GGG TGG ATG 
Pro Tyr Ala Leu Tyr Arg Ser Asp Phe Phe Pro Gly Leu Gly Trp Met 
255 260 265 



CTC AAG AGA TCG ACT TGG GAT GAG TTA TCA CCA AAG TGG CCA AAG GCT 



1 




1 Leu Lys Arg Ser Thr Trp Asp Glu Leu Ser Pro Lys Trp Pro Lys Ala 

2 270 275 280 

4 TAC TGG GAT GAT TGG CTG AGA CTA AAG GAA AAC CAT AAA GGC CGC CAA 1034 

5 Tvr Trp Asp Asp Trp Leu Arg Leu Lys Glu Asn His Lys Gly Arg Gin 

6 285 290 295 300 

8 TTC ATT GCA CCG GAA GTC TGT AGA ACA TAC AAT TTT GGT GAA CAT GGG 1082 

9 Phe He Ala Pro Glu ValCys Arg Thr Tyr Asn Phe Gly Glu His Gly 
10 305 310 315 

12 TCT AGT TTG GGA CAG TTT TTC AGT CAG TAT CTG GAA CCT ATA AAG CTA 1130 

13 Ser Ser Leu Gly Gin Phe Phe Ser Gin Tyr Leu Glu Pro He Lys Leu 

14 320 325 330 
15 

16 AAC GAT GTG ACG GTT GAC TGG AAA GCA AAG GAC CTG GGA TAC CTG ACA 1178 

17 Asn Asp Val Thr Val Asp Trp Lys Ala Lys Asp Leu Gly Tyr Leu Thr 

18 335 340 345 
1 9 

2 0 GAG GGA AAC TAT ACC AAG TAC TTT TCT GGC TTA GTG AGA CAA GCA CGA 122 6 

21 Glu Gly Asn Tyr Thr Lys Tyr Phe Ser Gly Leu Val Arg Gin Ala Arg 

22 350 355 360 
2 3 

24 CCA ATT CAA GGT TCT GAC CTT GTC TTA AAG GCT CAA AAC ATA AAG GAT 1274 

25 Pro He Gin Gly Ser Asp Leu Val Leu Lys Ala Gin Asn He Lys Asp 

26 365 370 375 380 
27 

2 8 GAT GAT CGT ATC CGG TAT AAA GAC CAA GTA GAG TTT GAA CGC ATT GCA 1322 

2 9 Asp Asp Arg He Arg Tyr Lys Asp Gin Val Glu Phe Glu Arg He Ala 
30 385 390 395 

3 2 GGG GAA TTT GGT ATA TTT GAA GAA TGG AAG GAT GGT GTG CCA CGA ACA 1370 
3 3 Gly Glu Phe Gly He Phe Glu Glu Trp Lys Asp Gly Val Pro Arg Thr 

34 400 405 410 

3 6 GCA TAT AAA GGA GTA GTG GTG TTT CGA ATC CAG ACA ACA AGA CGT GTA 1418 

37 Ala Tyr Lys Gly Val Val Val Phe Arg lie Gin Thr Thr Arg Arg Val 

38 415 420 425 

3 9 

4 0 TTC CTG GTT GGG CCA GAT TCT GTA ATG CAG CTT GGA ATT CGA AAT TCC 14 66 

41 Phe Leu Val Gly Pro Asp Ser Val Met Gin Leu Gly He Arg Asn Ser 

42 430 435 440 
43 

44 TGA TGCAAAACAT AT G AAAG GAA AAGAAG AT T T TGGACCGCAT GCAGCCTCCT 1519 
45 

46 445 
4 7 

4 8 TCTAGCAGCT GTTAGGTTGT ATTGTTATTT ATG GAT GAG T TTGTAGAGCG GTGGGGTTAA 157 9 

4 9 

5 0 CTTTAACAGC AAGGAAGCTC TGGTGACCAG GCTGATTGGC TTAGAAGTTA TGGGAACCCC 1639 

52 TTGAAAGGGT CAGGGTTAAA TAT AT TTC AG TTGTTTTATT AGT GAT TAT C TTGTGGGTAA 1699 

54 CTT AT ACG AA TGCAAATCAT TCTATGCAGT TTTTCTTCGT CCCACTTGTT TTGGCTTCTC 1759 

5 6 TATTGCTAGT GTACATATCT CTTCAAACAT GTACTAAATA ATGCGTGTTG CTTCAAAGAA 1819 
5 7 

5 8 GTAACTTTTA TTAAAAAAAA AAAAAAAAAC TCGAG 1854 
59 

60 

61 (2) INFORMATION FOR SEQ ID NO: 6: 
62 

6 3 (i) SEQUENCE CHARACTERISTICS: 
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]_ (A) LENGTH: 444 amino acids 

2 (B) TYPE: Amino acid 

3 ( D) TOPOLOGY : Linear 
4 

5 (ii) MOLECULE TYPE: Protein 

6 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
7 
8 



10 



12 
13 
14 



19 
20 



Met Ala Arg He Ser Cys Asp Leu Arg Phe Leu Leu He Pro Ala Ala 



9 1 5 10 15 



11 Phe Met Phe He Tyr He Gin Met Arg Leu Phe Gin Thr Gin Ser Gin 



20 25 30 



Tyr Ala Asp Arg Leu Ser Ser Ala He Glu Ser Glu Asn His Cys Thr 
15 35 40 45 

1 6 

17 Ser Gin Met Arg Gly Leu He Asp Glu Val Ser He Lys Gin Ser Arg 

18 50 55 60 



He Val Ala Leu Glu Asp Met Lys Asn Arg Gin Asp Glu Glu Leu Val 
21 65 70 75 80 

22 

Gin Leu Lys Asp Leu He Gin Thr Phe Glu Lys Lys Gly He Ala Lys 
85 90 95 



23 
24 
25 

26 Leu Thr Gin Gly Gly Gin Met Pro Val Ala Ala Val Val Val Met Ala 

27 100 105 110 
2 8 

2 9 cys Ser Arg Ala Asp Tyr Leu Glu Arg Thr Val Lys Ser Val Leu Thr 
30 115 120 125 

31 

32 Tyr Gin Thr Pro Val Ala Ser Lys Tyr Pro Leu Phe He Ser Gin Asp 

33 130 135 140 
34 

3 5 Gly Ser Asp Gin Ala Val Lys Ser Lys Ser Leu Ser Tyr Asn Gin Leu 
36 145 150 155 160 
37 

3 8 Thr Tyr Met Gin His Leu Asp Phe Glu Pro Val Val Thr Glu Arg Pro 
39 165 170 175 

40 

41 Gly Glu Leu Thr Ala Tyr Tyr Lys He Ala Arg His Tyr Lys Trp Ala 

42 180 185 190 
43 

44 Leu Asp Gin Leu Phe Tyr Lys His Lys Phe Ser Arg Val He He Leu 

45 195 200 205 

4 6 

4 7 Glu Asp Asp Met Glu He Ala Pro Asp Phe Phe Asp Tyr Phe Glu Ala 
48 210 215 220 

4 9 

50 Ala Ala Ser Leu Met Asp Arg Asp Lys Thr He Met Ala Ala Ser Ser 

51 225 230 235 240 

53 Trp Asn Asp Asn Gly Gin Lys Gin Phe Val His Asp Pro Tyr Ala Leu 

54 245 250 255 
55 

5 6 Tyr Arg Ser Asp Phe Phe Pro Gly Leu Gly Trp Met Leu Lys Arg Ser 
57 260 265 270 

5 8 

5 9 Thr Trp Asp Glu Leu Ser Pro Lys Trp Pro Lys Ala Tyr Trp Asp Asp 
60 275 280 285 

62 Trp Leu Arg Leu Lys Glu Asn His Lys Gly Arg Gin Phe He Ala Pro 

63 290 295 300 
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2 Glu Val Cys Arg Thr Tyr Asn Phe Gly Glu His Gly Ser Ser Leu Gly 

3 305 310 315 320 
4 

5 Gin Phe Phe Ser Gin Tyr Leu Glu Pro He Lys Leu Asn Asp Val Thr 

6 325 330 335 
7 

8 Val Asp Trp Lys Ala Lys Asp Leu Gly Tyr Leu Thr Glu Gly Asn Tyr 

9 340 345 350 

11 Thr Lys Tyr Phe Ser Gly Leu Val Arg Gin Ala Arg Pro He Gin Gly 

12 355 360 365 
13 

14 Ser Asp Leu Val Leu Lys Ala Gin Asn He Lys Asp Asp Asp Arg He 

15 370 375 380 
1 6 

17 Arq Tyr Lys Asp Gin Val Glu Phe Glu Arg He Ala Gly Glu Phe Gly 

18 385 390 395 400 
19 

20 He Phe Glu Glu Trp Lys Asp Gly Val Pro Arg Thr Ala Tyr Lys Gly 

21 405 410 415 
22 

2 3 Val Val Val Phe Arg He Gin Thr Thr Arg Arg Val Phe Leu Val Gly 

24 420 425 430 

25 

2 6 Pro Asp Ser Val Met Gin Leu Gly He Arg Asn Ser 

27 435 440 

28 



